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The  Annual  Report  for  Award  Number  DAMD17-97-1-7070 

The  Proposal  Title:  The  Use  of  a  New  Technique  to  Study  DNA  Methylation  in  Breast 
Cancer 

The  overall  goal  of  the  proposal  funded  by  the  Department  of  Defense  Breast  Cancer 
Research  Program,  is  to  understand  the  role(s)  played  by  DNA  methylation  in  human  breast 
cancer  development.  To  reach  this  objective,  our  specific  aims  are: 

1.  to  develop  and  refine  a  new  technique.  Methyl  Differential  Display  (MDD),  to  study 
the  role(s)  of  DNA  methylation  in  human  breast  cancer  development. 

2.  to  apply  this  technique  to  isolate  genomic  markers  which  detect  altered  DNA 
methylation  patterns  in  breast  cancer  cells. 

3.  to  search  for  new  types  of  oncogenes  and  recessive  oncogenes  whose  expressions  are 
under  the  control  of  DNA  methylation  machinery  by  differentially  methylated 
genomic  DNA  fragments  (DMGFs). 

4.  to  determine  biological  function(s)  of  the  candidate  genes. 

5.  to  search  for  their  (genomic  markers  and  candidate  genes)  potential  use  in  clinical 
diagnosis  and  prognosis. 

In  the  past  year,  our  studies  progress  has  included  the  following  achievements:  1).  To 
search  for  a  coding  region,  a  DMGF,  BR50,  was  used  as  a  probe  to  screen  a  human  genomic 
DNA  phage  library,  and  a  longer  DNA  fragment,  containing  at  least  two  exons,  was 
discovered;  2).  Based  on  the  sequence  information  of  those  two  exons,  a  700  bp  coding 
region  was  generated  by  PCR  amplification;  3).  Northern  analysis  demonstrated  that  the 
target  gene  was  highly  expressed  in  human  testes;  4).  The  full  length  gene,  TSP50,  was  then 
acquired  from  a  human  testes  cDNA  library;  5).  Homologous  analysis  revealed  that  the  new 
gene  could  be  a  serine  protease;  6).  Methylation  sensitive  southern  blots  suggest  that 
expression  of  TSP50  could  be  regulated  by  DNA  methylation;  and  7).  The  TSP50  gene  was 
abnormally  activated  in  some  breast  cancer  patients. 

In  the  following  paragraphs,  a  detailed  report  of  our  achievements  will  be  presented. 
The  overall  achievements  related  to  the  TSP50  gene  have  been  published  in  the  journal  of 
Cancer  Research  (1,  see  Appendix). 

Materials  and  Methods 

Northern  analysis.  Two  Human  Multiple  Tissue  Northern  Blot  panels,  MTN™,  and 
MTN™  II,  were  purchased  from  Clontech  Inc.  The  MTN™  blot  contains  approximately  2 
pg  of  polyA+  RNA  per  lane  from  eight  different  human  tissues  (heart,  brain,  placenta,  lung, 
liver,  skeletal  muscle,  kidney,  and  pancreas).  The  MTN™  II  blot  contains  the  same 
amounts  of  mRNA  from  an  additional  eight  different  human  tissues  (spleen,  thymus, 
prostate,  testes,  ovary,  small  intestine,  colon,  and  peripheral  blood  leukocyte).  The  probes 
labeling  and  detection  were  the  same  as  above. 
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Human  complimentary  DNA  (cDNA)  library  screening.  A  human  testes  Xgtll  cDNA 
library  (Human  Testis  cDNA  library,  Clontech.  Inc.)  was  used  to  obtain  an  intact  gene 
following  the  instructions  of  the  manufacturer.  2  X  105  plaques  evenly  distributed  on  6 
plates  (150  X  15  mm)  were  transferred  onto  Hybond  N-membranes.  Southern  analysis  was 

performed  as  before.  The  phage  DNA,  with  human  cDNA  insert,  was  purified  by  the  X 
Quick!  Spin  Kit  (BIO  101,  Inc.)  following  the  instructions  of  the  manufacturer.  The 
individual  insert  was  released  by  restriction  enzyme  Eco  RI  cleavage  from  the  phage  DNA 
arms,  then  subcloned  into  pUC118  plasmid. 

Reverse  Transcription-PCR  (RT-PCR).  Total  RNAs  were  isolated  from  paired  breast  cancer 
and  normal  tissues  by  RNA  isolation  kit,  RNA  STAT-60  (TEL-TEST,  Inc.).  The  first  strand 
cDNA  was  synthesized  by  Superscript  Preamplification  system  kit  (GIBCOBRL,  Life 
Technologies).  Oligomers  E  (ACCAGAGCGTCCAGTGTGTCC,  sense)  and  F 
(TGGGACTTGATGATCTGAACC,  antisense)  were  used  to  synthesize  the  TSP50  gene.  The 
predicted  size  was  699  bp.  fi-actin  was  used  as  an  internal  control  whose  sense  and  antisense 
primers  were:  5'-G  ACGACAT  GGAGA  AGA  TCTGG-3 '  and  5'- 

TGTAGAGGTAGTCAGTCAGG-3'.  The  predicted  size  for  fi-actin  was  335  bp.  The  PCR 
reaction  mixture  consisted  of  cDNA  derived  from  125  ng  of  RNA,  10  pmole  of  sense  and 
antisense  primers  from  both  TSP50  and  fi-actin,  200  p,M  of  four  deoxynucleotide 
triphosphate,  and  0.125  unit  of  Taq  DNA  polymerase  with  reaction  buffer  (Perkin  Elmer)  in 
a  final  volume  of  25  pi.  Thirty  eight  cycles  of  PCR  were  carried  out.  Each  cycle  of  PCR 
included  30  seconds  of  denaturation  at  95  °C,  60  seconds  of  annealing  at  60  °C,  and  60 
seconds  of  extension  at  72  oC.  The  PCR  products  were  separated  on  a  2%  agarose  gel. 

Results 

Search  for  the  coding  regions  by  probe  BR50.  Based  on  our  preliminary  analysis,  BR50  was 
an  interesting  probe  to  be  used  for  an  adjacent  gene  search  because  it  was  differentially 
methylated  in  some  breast  and  ovarian  cancer  samples  tested,  even  though  the  sample  size 
was  small.  BR50  is  a  genomic  fragment,  and  consequently  the  chances  that  it  encodes  a 
polypeptide  are  slim  because  a  majority  of  human  genomic  DNA  sequences  are  non-coding 
sequences.  As  a  result,  we  decided  that  the  first  step  in  searching  for  a  gene  should  be  to 
isolate  a  longer  DNA  piece  from  a  human  genomic  DNA  library,  with  the  hope  that  it  may 
contain  exon(s).  It  is  well  known  that  DNA  methylation  sites  can  be  near  genes,  therefore, 
we  decided  to  screen  a  human  placenta  genomic  phage  library,  EMBL3  SP6/T7  (Clontech, 
Inc.),  where  the  insertion  sizes  were  relatively  small  (9  to  28  kbp).  For  this  purpose,  2  X  106 
plaques  derived  from  the  library  were  screened  by  probe  BR50.  As  a  result,  a  17  kbp  length 
clone  was  isolated.  To  obtain  the  sequence  information,  the  DNA  clone  was  released  from 
the  phage  DNA  arms  by  restriction  enzyme  Sst  I  cleavage  which  generated  eight  DNA 
fragments.  All  eight  fragments  were  subcloned  into  pUC118  plasmids.  The  fragments 
smaller  than  1  kbp  were  completely  sequenced,  while  the  fragments  larger  than  1  kbp  were 
partially  sequenced  from  both  the  plasmid  and  insert  junctions.  A  homolog  search  of  the 
NIH  GeneBank  revealed  two  exons,  BR50-44  and  BR50-45,  which  contained  112  and  132 
nucleotides,  respectively.  Both  exons  encoded  polypeptides  which  were  about  50% 
identifiable  to  several  mammalian  proteases,  such  as  serine  proteases  and  tryptases.  The 
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BR50-45  sequence  was  found  142  bps  up  stream  of  the  BR50  sequence. 

Since  we  did  not  know  the  exact  positions  of  the  two  exons  in  the  17  kbp  fragment,  it 
was  possible  that  other  exon(s)  might  lay  between  them.  To  gain  a  longer  coding  sequence, 
we  designed  four  oligomers  (A,  B,  C,  D)  based  on  both  exon's  sequence  information  to 
perform  PCR.  Oligomer  A  (5'-CCTGGATGGTCAGCGTG-3')  and  B  (5'- 
CTGGGAGGCAATGATGGT-3'),  which  were  on  the  complimentary  strand,  were  based  on 
the  sequence  information  of  BR-44;  and  C  (5'-CTGGAGAGCCCTTGGTCT-3')  and  D  (5'- 
CAGTGTTGGTAGGAGGAG-3'),  which  were  on  the  complimentary  strand,  were  based  on 
the  sequence  information  of  BR-45.  A  strategy  using  four  different  combinations  of 
oligomer  pairs  was  employed  to  perform  PCR  by  utilizing  the  Human  Universal  cDNA 
Library  Panel  (Clontech  Inc.).  A  PCR  product  which  was  about  700  bps  in  length  was 
generated  from  one  oligomer  combination  (A/D,  5'-CCTGGATGGTCAGCGTG-3'/5'- 
CAGTGTTGGTAGGAGGAG-3').  This  PCR  product  was  directly  sequenced.  Combining  the 
sequence  information  of  the  PCR  product  and  the  two  exons,  we  obtained  a  cDNA  fragment 
which  contained  974  bps.  The  DNA  homolog  search  of  the  NIH  GeneBank  revealed  again, 
that  it  coded  for  a  protease  like  protein,  and  the  overall  identity  was  approximately  40%. 

The  candidate  gene  is  highly,  and  specifically  expressed  in  human  testes.  To  obtain  a  full 
length  cDNA,  it  is  critical  to  use  the  right  cDNA  library  where  the  gene  of  interest  is 
expressed.  Thus,  two  Human  Multiple  Tissue  Northern  Blot  panels,  MTN™  and  MTN™ 
II,  containing  16  different  tissue  mRNAs  (Clontech  Inc.),  were  used  to  test  the  expression  of 
the  candidate  gene  by  using  the  700  bp  cDNA  PCR  product  as  a  probe.  The  results  showed 
that  there  were  no  visible  transcripts  of  this  gene  in  the  eight  mRNAs  (heart,  brain, 
placenta,  lung,  liver,  skeletal  muscle,  kidney,  and  pancreas)  included  in  the  MTN™  panel 
(data  not  shown).  In  the  MTNTM  n  panel,  a  1.7  kb  band  was  heavily  hybridized  by  the  probe 
exclusively  in  the  testes  mRNA  as  compared  to  the  control  probe,  which  was  the  human 
rab6  gene  (Fig.  1).  These  results  suggested  that  the  gene  that  we  were  searching  for  is  a 
tissue  specific  gene.  We  have  named  the  gene  TSP50  (Testes  Specific  Protease).  At  this 
moment,  the  gene's  biological  function(s)  in  human  testes  remains  unknown. 

Isolation  of  the  full  length  TSP50  gene  from  a  human  testes  cDNA  library.  Since  TSP50  is 
highly  expressed  in  human  testes,  to  search  for  the  full  length  TSP50  gene,  a  human  testes 
cDNA  library  (Human  Testis  cDNA  library,  Clontech.  Inc.)  was  screened  by  the  700  bp  TSP50 
cDNA  sequence.  A  cDNA  clone  containing  the  probe  sequence  was  isolated.  Sequence 
analysis  suggests  that  this  fragment  encodes  a  protein  with  385  animo  acids  (Fig.  2.).  There 
is  a  stop  codon  located  at  the  117th  bp  up  stream  of  the  initial  translation  site,  and  there  is  a 
125  bp  untranslated  region  before  a  polyadenosine  sequence.  These  results  imply  that  a  full 
length  gene  has  been  obtained.  It  is  also  notable  that  the  BR50  probe  sequence  is  located  at 
the  3'  end  of  the  gene,  and  is  only  17  bps  down  stream  from  the  polyadenosine  adding 
signaling  site.  The  exons,  BR50-44  and  BR50-45,  encode  animo  acids  from  103  to  157  and  308 
to  385,  respectively.  The  3'-  untranslated  region  before  the  polyadenosine  site  is  also 
included  in  the  sequence  of  BR50-45. 

DNA  methylation  status  of  the  TSP50  gene  in  human  testes  and  other  normal  tissues.  Our 

studies  have  proven  that  TSP50  is  a  tissue  specific  gene,  and  the  methylation  patterns  in  its 
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3'-  region  were  altered  in  some  breast  and  ovarian  cancers.  It  is  also  known  that  many 
tissue  specific  genes  are  methylated,  and  this  methylation  may  regulate  their  expression  (2- 
4).  To  explore  the  possible  relationship  between  TSP50  gene  expression  and  DNA 
methylation  in  different  normal  human  tissues,  southern  analysis  was  performed.  The 
normal  tissues  tested  included  the  testes,  where  TSP50  was  expressed,  and  bladder,  blood, 
breast,  colon,  lung,  kidney,  placenta,  and  ovary  samples,  where  TSP50  was  apparently  not 
expressed.  To  perform  the  southern  analysis,  BR50  was  used  as  a  probe.  DNAs  isolated 
from  the  nine  tissues  were  digested  by  Msp  I,  and  Hpa  II,  which  is  an  isoschizomer  of  the 
Msp  I  enzyme  and  the  most  popular  enzyme  used  to  study  DNA  methylation  patterns  (5). 
Hpa  II  digestion  showed  that  in  the  testes  DNA,  two  bands,  probably  released  from  each 
allele  by  enzyme  cleavage,  were  hybridized  by  the  probe.  However,  in  the  other  tissues' 
DNAs,  the  corresponding  bands  were  either  not  hybridized,  or  hybridized  to  a  much  smaller 
degree  (Fig.  3a).  For  Msp  I  cleavage,  both  bands  were  released  in  different  tissues  to  various 
extents  (Fig.  3b).  Both  blots  utilized  a  genomic  fragment  which  did  not  detect  differential 
DNA  methylation  as  a  control  to  determine  complete  enzymatic  digestion  (Fig.  3).  These 
results  demonstrated  that  the  TSP50  gene  was  differentially  methylated  in  various  human 
tissues.  In  general,  DNA  demethylation  in  the  testes  is  correlated  with  high  levels  of  gene 
expression.  Conversely,  DNA  methylation  is  correlated  with  gene  silencing  in  the  bladder, 
blood,  breast,  colon,  lung,  kidney,  placenta,  and  ovary  tissues. 

Comparison  of  TSP50  product  sequence  with  other  serine  proteases.  The  sequence  analysis 
revealed  that  the  TSP50  gene  encodes  a  protein  which  shares  approximately  40%  identity 
with  mammalian  serine  proteases.  Figure  ^compares  the  TSP50  animo  acid  sequence  with 
7  other  serine  proteases  including  prostasin  (6),  plasma  kallikrein  (7),  coagulation  factor  XI 
(8),  fi-tryptase  (9),  hepsin  (10),  plasminogen  (11),  and  acrosin  (12).  Proteolytic  enzymes 
dependent  on  a  serine  residue  for  catalytic  activity  are  widespread  and  very  numerous. 
Serine  proteases  are  found  in  viruses,  bacteria,  and  eukaryotes,  and  they  include 
exopeptidases,  endopeptidases,  oligopeptidases,  and  omega  peptidases.  Over  20  families  of 
serine  peptidases  are  recognized  (13,14),  and  grouped  into  clans  that  may  have  common 
ancestors.  The  peptidases  of  chymotrypsin,  subtilisin,  and  carboxypeptidase  C  clans  have  in 
common  a  "catalytic  triad"  of  three  amino  acids:  Serine  (Ser,  nucleophile).  Aspartate  (Asp, 
electrophile),  and  Histidine  (His,  base).  However,  there  are  some  serine  peptidases  that 
have  distinctive  mechanisms  of  action  without  the  classic  Ser,  His,  Asp  triad.  The  multiple 
sequence  alignment  for  tsp50p  showed  that  it  contains  triad  His153  and  Asp206.  However, 
the  Ser  at  position  310  has  been  replaced  by  threonine  (Thr)  (Fig.  The  corresponding 
nucleotides  for  coding  Thr  in  the  TSP50  gene  were  ACT,  while  one  base  pair  switch,  such  as 
C  to  G,  will  result  in  a  Ser  codon,  AGT.  As  a  result,  one  may  wonder  whether  this  change 
was  caused  by  a  point  mutation  happening  in  the  cells,  or  an  error  in  DNA  sequencing. 
Based  on  our  experience,  these  assumption  are  unlikely  since  the  DNA  fragment  containing 
this  codon  was  isolated  from  two  individual  libraries,  the  human  placenta  genomic  phage 
library  and  testes  cDNA  library,  and  the  ACT  codon  was  verified  by  DNA  sequencing  in  both 
fragments.  This  implies  that  the  Ser  was  replaced  by  Thr  in  the  predicted  Ser  triad  of  tsp50p. 
However,  Thr  and  Ser  residues  are  structurally  similar  (Thr  has  an  extra  methyl  group  as 
compared  to  Ser).  Both  Thr  and  Ser  contain  the  HO  group  that  is  critical  for  enzymatic 
catalysis.  In  addition,  the  Thr  residue  in  tsp50p  was  surrounded  with  conserved  residues 
including  a  crucial  glycine  (Gly)  (Fig.  £).  Usually  the  linear  order  of  catalytic-site  residues. 
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clusters  of  conserved  amino  acids  around  the  catalytic  residues,  are  important  factors  to 
classify  a  protease  (13,14).  Therefore,  tsp50p  could  be  a  new  type  of  serine  protease,  possibly 
with  a  distinctive  mechanism  of  action. 

TSP50  was  differentially  expressed  in  some  breast  cancer  tissues.  Preliminary  results 
demonstrated  that  TSP50  was  differentially  methylated  in  40%  of  the  breast  cancer  tissues 
tested.  This  suggested  that  it  could  also  be  differentially  expressed  in  cancer.  To  test  this 
possibility,  RT-PCR  was  carried  out  to  determine  TSP50  expression  levels  in  eighteen  paired 
breast  cancer  tissues.  Our  findings  showed  that  TSP50  PCR  products  were  generated  in  five 
tumor  tissues,  while  in  their  normal  controls,  they  were  not  visible  relative  to  the  control 
gene,  fi-actin  (Fig.  tf).  Products  generated  from  the  five  patients  were  gel  purified  and 
sequenced.  DNA  sequence  analysis  confirmed  that  the  PCR  products  synthesized  were  the 
TSP50  gene  (data  not  shown).  Therefore,  among  the  eighteen  paired  samples  tested,  28%  of 
the  tissues  expressed  the  TSP50  gene.  At  this  moment,  we  can  not  answer  the  question  of 
whether  activation  of  the  TSP50  gene  in  cancer  is  a  consequence,  or  a  causal  factor  of 
neoplastic  growth.  To  find  the  truth,  it  will  be  necessary  to  perform  in  vitro  cellular 
transformation  and  in  vivo  tumor  induction  assays. 

Discussions 

Since  DMGF  generated  by  the  MDD  technique  could  be  close  to  gene  of  interest  our 
first  step  in  processing  the  gene  search  was  to  screen  a  human  genomic  phage  library.  A  17 
kbp  DNA  fragment  was  isolated  and  sequence  analysis  suggested  that  this  fragment 
contained  at  least  two  exons  which  were  homolog  to  mammalian  proteases.  The  exons' 
sequencing  information  led  to  the  discovery  of  a  974  bp  gene  fragment  from  a  human  cDNA 
library  panel  by  PCR  amplification.  To  obtain  a  full  length  gene,  a  northern  evaluation  on 
sixteen  different  types  of  human  RNAs  was  performed.  The  results  demonstrated  that  the 
target  gene  was  specifically  expressed  in  human  testes  tissue.  This  information  secured  the 
isolation  of  an  intact  gene,  TSP50,  by  screening  a  human  testes  cDNA  library.  The  sequence 
analysis  revealed  that  the  TSP50  gene  most  likely  encodes  a  serine  protease. 

It  is  well  known  that  proteases  of  all  major  classes  (i.e.,  serine,  aspartic,  cysteine, 
threonine,  and  metalloproteinases)  are  linked  with  various  malignancies,  especially  those 
exhibiting  the  metastasis  phenotype.  For  examples,  prostate  specific  antigen  (PSA)  is  a 
kallikrein-like  serine  protease  that  is  utilized  as  a  clinical  marker  for  the  diagnosis  and 
staging  of  prostate  cancer  where  its  preferential  expression  in  prostate  epithelial  cells  is 
increased  (15,16).  In  addition,  the  matrix  metalloproteinases  (MMPs)  have  been  repeatedly 
implicated  in  metastasis  (17-19).  Since  the  TSP50  gene  product  (tsp50p)  could  be  a  serine 
protease,  and  was  activated  in  breast  cancer  cells,  it  is  logical  to  proceed  to  the  next  step  and 
asks  whether  this  gene  plays  a  role  in  breast  cancer  invasiveness? 

It  is  common  knowledge  that  many  tissue  specific  genes'  expression  is  regulated  by 
DNA  methylation  which  usually  modify  the  promoter,  or  sometimes,  3'-  regions  (2-4,  20). 
Our  preliminary  results,  although  only  obtained  from  analyzing  the  DNA  methylation 
status  of  the  3'  flanking  region  of  the  gene,  have  proven  that  TSP50  is  one  of  those  tissue 
specific  genes.  It  will  be  interesting  to  discover  whether  the  gene's  promoter  region  is  also 
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methylated  when  the  corresponding  sequence  information  is  available.  The  Hpa  II  and  Msp 
I  methylation  sensitive  southern  analysis  of  the  TSP50  gene's  3'-  region  demonstrated  that, 
in  Hpa  II  digested  DNAs,  probe  BR50  hybridized  two  bands  in  the  testes  tissue,  but  none  in 
the  other  samples.  The  lower  band,  which  was  the  same  size  as  the  probe,  represented  the 
unmethylated  DNA  pattern,  while  the  upper  band  obviously  contained  the  internal  Hpa  II 
recognition  site(s)  which  remained  methylated.  In  Msp  I  digested  DNAs,  the  upper  band 
was  dominant  in  most  tissues,  while  in  the  testes,  the  lower  band  was  dominant.  These 
results  suggest  that  the  GGCCGG  end  of  BR50  was  methylated  in  other  tissues,  but  not  in  the 
testes.  The  DNA  methylation  patterns  observed  in  both  blots  are  probably  allelic  oriented. 
It  seems  that  DNA  hypomethylation  was  accompanied  by  the  gene's  expression  in  the  testes, 
and  conversely,  DNA  hypermethylation  was  accompanied  by  the  gene's  silencing  in  other 
tissues.  The  correlation  between  DNA  methylation  and  gene  expression  provided 
additional  proof  that  DNA  methylation  could  be  an  important  mechanism  in  governing 
the  genes'  expression  in  various  differentiated  human  cells. 

In  addition,  the  differential  expression  of  the  TSP50  gene  has  been  tested  in  eighteen 
paired  breast  cancer  biopsies.  Our  findings  have  shown  that  this  gene  was  activated  in  five 
cancer  samples.  This  finding  indicates  that  this  novel  gene's  expression  is  related  to  breast 
cancer  progression.  In  the  near  future,  more  samples  from  different  types  of  cancer  will  be 
examined,  and  the  possibility  that  the  TSP50  gene  product  might  be  one  of  the  factors  that 
stimulate  human  cancer  will  be  further  explored. 

GenBank  accession  numbers.  BR50:  U78781,  TSP50:  AF100707. 

Figures  and  Figure  Legends. 

Fig.  1.  Northern  blot  results  analyzed 
by  a  fragment  of  the  TSP50  gene.  The 
Human  Multiple  Tissue  Northern 
Blot  panel,  MTNTM  II,  containing  8 
different  tissues'  mRNA  (Clontech  Inc.) 
was  tested  to  determine  the  expression 
levels  of  the  gene.  Compared  to  the 
control,  the  human  Rab6  gene,  which 
was  evenly  expressed  in  all  tissues, 
TSP50  was  highly  expressed  in  the  testes 
tissue,  but  not  in  the  others.  No  single 
tissue  (total  eight)  in  another  Human 
Multiple  Tissue  Northern  Blot  panel 
was  hybridized  with  the  probe  (data  not 
shown). 


mRNA 


0) 

0 
c 
’*#— » 

</) 

0  0 
*5*  c 

o  ^  I  W  tc 

III  111 

©  o  </> 


0 


3  <= 
C  © 

Is 


6  &  o.  £  » 


TSP50 


1.7k 


■  control 
probe 


LIW  RDVIYSVRVG 

FP S  EHI1KEAYEVK 
FDG  LPLQDVWRTY 

VGP  D.VKDLAALR 

L.E  KSPRPSSYKV 

>■  VG  KNKVKbWK'GV 


3T  DYVLKDHSR . 
\A  NAS F PNG . LH 

:;k  gdtstiy.tn 

?A  SETFPPG.MP 
3P  NYWA.DRTE 

i ,  kaot.pugso.s 


|G  DSGGPLfl 
DSGGPlI 
DSGGPlI 
jG  DSGGPlI 

]g  dsggpH 
DSGGPlI 
DSGGPlI 


ith  other  serine 
d  to  the  mature 
■ein,  coagulation 
that  are  highly 
and  serine  are 
lignment. 


a 


CO 

CO 

c 

o 

c 

CO 

CO 

> 

CO 

0) 
•4— • 

0 

"D 

"O 

"O 

o 

CD 

■4—* 

c 

0 

O 

>> 

0 

c 

8> 

o 

CO 

0) 

CO 

o 

c 

CO 

“O 

XI 

o 

o 

X5 

-Q 

-5 

CL 

BR50 


control 


CO 

c 

c 

CO 

CO 

0 

l— 

0 

T5 

T5 

CO 

o 

C0 

g 

TJ 

O 

o> 

2 

o 

o 

CO 

0 

CO 

-Q 

O 

SI 

c 

=3 

c  >> 
0  0 


Fig.  3.  DNA  methylation  status  of  the 
TSP50  gene  in  nine  normal  human 
tissues  examined  by  southern  blot.  In  a 
and  b,  the  results  obtained  from  Hpa  II 
and  Msp  I  digestion,  respectively.  6  |lg 
of  DNA  isolated  from  each  tissue  were 
cleaved  by  the  enzymes  and  subjected  to 
southern  analysis  by  probe  BR50.  a.  The 
results  show  that  bands  which  are 
approximately  1  kbp  and  2  kbp  in  length 
were  released  by  Hpa  II  only  in  the 
testes  tissue,  b.  A  2  kbp  band  was 
released  by  Msp  I  in  most  tissues.  In  a 
and  b.  The  control  probe  hybridizes  a 
single  band  in  each  tissue's  DNA,  which 
provides  proof  of  complete  enzymatic 
digestion. 
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Fig.  4.  The  results  of  RT-PCR  for 
differential  expression  of  TSP50 
in  five  out  of  eighteen  breast 
o  cancer  and  normal  control  tissues 
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and  fi-actin  from  testes  cDNA. 
The  number  for  each  patient 
tested  is  listed  above  the  bracket. 
T  and  N  represent  tumor  and 
normal  tissues.  The  result  shows 
that  the  TSP50  gene  was 

abnormally  activated  in  approximately  30%  of  breast  cancer  patients. 
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ABSTRACT 

A  novel  gene,  testes-specific  protease  50  ( TSP50 ),  was  isolated  from  a 
human  testes  cDNA  library  by  using  a  genomic  DNA  probe,  BR50 .  BR50 
was  isolated  by  a  modified  representational  difference  analysis  (RDA) 
technique  due  to  its  hypomethylated  feature  in  a  breast  cancer  biopsy. 
This  altered  DNA  methylation  status  was  also  detected  by  BR50  in  other 
breast  and  some  ovarian  cancer  tissues.  The  TSP50  gene  product  is  a 
homologue  to  several  human  proteases,  which  indicates  that  it  may  encode 
a  protease-like  protein.  Northern  analysis  of  16  different  types  of  normal 
human  tissues  suggests  that  TSP50  was  highly  and  specifically  expressed 
in  human  testes,  which  indicates  that  it  might  possess  a  unique  biological 
function(s)  in  that  organ.  Methylation  status  analysis  in  normal  human 
testes  and  other  tissues  showed  a  correlation  between  DNA  methylation 
and  gene  expression.  Most  importantly,  reverse  transcription-PCR  anal¬ 
ysis  of  18  paired  breast  cancer  tissues  found  that  in  28%  of  the  cancer 
samples,  the  TSP50  gene  was  differentially  expressed.  The  possibility  that 
TSP50  may  be  an  oncogene  is  presently  under  investigation. 

INTRODUCTION 

Abnormal  DNA  methylations  (hypomethylation  or  hypermethyla- 
tion)  have  been  linked  to  various  human  diseases  including  cancers 
(1-8).  Because  methylated  DNA  sites  are  usually  close  to  genes 
(9-13),  searching  for  differentially  methylated  DNA  fragments  in 
cancer  could  pinpoint  genes  of  interest.  Consequently,  a  modified 
RDA  technique  using  human  breast  cancer  biopsies  as  starting  mate¬ 
rial  was  used  to  search  for  differentially  methylated  DNA  fragments. 
Unlike  traditional  RDA3  (14),  to  perform  modified  RDA,  the  restric¬ 
tion  enzyme  Mspl ,  which  is  sensitive  to  the  methylated  GC-rich 
sequence  GGCmCGG  (15-18),  was  used  as  a  master  enzyme  to  cleave 
genomic  DNAs  for  amplicon  preparation.  Mspl  is  a  relatively  frequent 
cutting  enzyme  (19),  which,  when  used  alone,  produces  an  amplicon 
of  high  complexity  that  can  cause  unsuccessful  subtractive  hybridiza¬ 
tion.  Hence,  a  second  restriction  enzyme,  or  “partner  enzyme,”  has 
been  incorporated  into  the  technique.  The  amplicons  can  only  be  made 
by  PCR  from  DNA  fragments  with  both  ends  cut  by  the  Mspl  enzyme. 

As  a  result  of  using  this  modified  technique,  two  DNA  fragments, 
BR50  and  BR254,  were  isolated  that  detected  DNA  hypomethylation 
in  breast  cancer.  Additional  studies  verified  that  both  fragments  also 
detected  hypomethylation  in  ovarian  cancer,  and  BR254  was  ampli¬ 
fied  in  1  of  10  breast  cancer  biopsies.  On  the  basis  of  these  findings, 
we  considered  both  fragments  good  candidates  to  search  for  genes  that 
might  be  related  to  various  malignancies.  This  report  will  focus  on 
presenting  the  detailed  studies  related  to  BR50 ,  which  covers  its  own 
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isolation  as  a  differentially  methylated  DNA  fragment,  to  its  utiliza¬ 
tion  in  the  discovery  of  a  novel  gene,  TSP50. 

Investigation  of  the  TSP50  gene  has  found  that  it  encodes  a  pro- 
tease-like  protein.  Northern  analysis  of  multiple  human  tissue  RNA 
expression  panels  showed  that  TSP50  is  a  tissue-specific  gene,  which 
was  heavily  expressed  in  human  testes.  There  were  almost  no  visible 
amounts  of  TSP50  transcript  displayed  in  the  other  15  types  of  human 
tissues  in  the  panels.  This  result  indicates  that  the  TSP 50  gene  holds 
a  special  physiological  function(s)  in  human  testes.  The  DNA  meth¬ 
ylation  status  of  the  downstream  region  of  the  gene  in  normal  human 
testes  and  eight  other  tissues  was  also  examined.  Apparently,  DNA 
methylation  silences  the  TSP50  gene  expression  in  those  eight  normal 
tissues,  whereas  DNA  demethylation  in  human  testes  could  be  a  key 
element  responsible  for  gene  expression.  Furthermore,  RT-PCR  was 
performed  to  examine  differential  expression  in  breast  cancer  and 
matched  normal  control  tissues.  We  found  that  ~28%  of  the  cancer 
samples  tested  expressed  the  TSP50  gene,  whereas  the  corresponding 
controls  did  not.  Whether  there  is  a  relationship  between  gene  expres¬ 
sion  and  cancer  development  is  presently  under  investigation. 

MATERIALS  AND  METHODS 

DNA  from  Human  Cancer  Biopsies.  Dissected  human  breast  and  ovarian 
cancer  tissues  (tumor  and  matched  normal)  were  immediately  frozen  in  liquid 
nitrogen  and  stored  at  —  70°C.  DNAs  were  isolated  from  those  tissues  by  the 
phenol  extraction  method  (20). 

Modified  RDA.  One  to  two  pg  of  DNA  (tester)  isolated  from  human  breast 
cancer  biopsies  and  their  matched  normal  DNA  (driver)  were  cleaved  with 
Mspl  (20  units/p,l;  Boehringer  Mannheim)  and  Msel  (20  units//d;  New  Eng¬ 
land  Biolabs,  Inc.)  in  a  50:p\  reaction  for  3  h.  To  prepare  tester  and  driver 
master  amplicons,  the  Mspl-  and  A/M-digested  tester  and  driver  genomic 
DNAs  were  ligated  to  1.5  pg  of  MSA24-mer  and  0.75  pg  of  MSA12-mer 
(Table  1 );  these  were  the  first  pair  of  oligonucleotide  linkers  that  only  recog¬ 
nize  the  ends  generated  by  Mspl.  The  procedures  for  amplicon  preparation 
were  performed  as  described  (14).  The  DNA  amplicons  were  then  purified  by 
phenol,  phenol/chloroform  extraction.  To  remove  the  first  set  of  linkers  from 
the  driver  amplicon,  80  pg  of  driver  amplicon  DNA  were  digested  with  the 
Mspl  enzyme  (10  units/ju.1).  To  change  the  tester  master  amplicon  DNA 
linkers,  5  pg  of  tester  master  amplicon  were  digested  with  Mspl  (20  units//u,l) 
and  ligated  to  0.6  pg  of  MSB24-mer  and  0.3  pg  of  MSB12-mer  (Table  1); 
these  were  the  second  set  of  oligonucleotide  linkers.  Subtractive  hybridization 
was  performed  as  described  (14).  The  first  round  of  difference  products  (DPI) 
were  amplified  as  described  (14).  To  prepare  the  second  round  of  subtractive 
hybridization,  3  pg  of  DPI  were  digested  with  the  restriction  endonuclease 
Mspl  (20  units//xl).  To  put  a  new  set  of  linkers  on  DPI,  0.1  pg  of  DPI  was 
mixed  with  0.6  pg  of  MSC24-mer  and  0.3  pg  of  MSC12-mer  (Table  1). 
Another  round  of  subtractive  hybridization/PCR  amplification  was  repeated. 
The  second  round  of  difference  products  (DP2)  usually  contained  several 
individual  DNA  fragments  when  electrophoresed  on  a  2%  agarose  gel.  The 
individual  DNA  fragments  were  purified  by  DNA  gel  extraction  kit  (Qiagen, 
Inc.)  and  subcloned  into  pUCl  18  vector,  which  was  linearized  by  the  restric¬ 
tion  endonuclease  AccI  and  transformed  into  Escherichia  coli  ( DH5a ).  Twelve 
cloned  inserts  were  chosen  to  be  amplified,  from  which  different-sized  probes 
were  selected  for  master  amplicon  Southern  blot.  The  candidate  probes  were 
then  further  tested  by  human  genomic  DNA  Southern  blot. 

Amplicon  DNA  Southern  Blot.  The  first  round  of  positive  probe  screen¬ 
ing  was  performed  with  amplicon  DNA  Southern  blots.  Non-Radiation  South- 
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Table  1  Sequence  of  the  oligonucleotides  for  PCR  amplification 


Primer 

Sequence 

MSA24 

5 '  - CTCGTCGTC AGGTC AGTGCTTC AC - 3 ' 

MSA12 

5 '  -CGGTGAAGCACT- 3 ' 

MSB24 

5 '  -TAGAGCCACGTAGCTGCTGTAGTC - 3 ' 

MSB12 

5 '  - CGGACTAC AGC A- 3 ' 

MSC24 

5 '  - ACCGTGGACTGGATAGGTTCAGAC - 3 ' 

MSC12 

5 '  -CGGTCTGAACCT- 3 ' 

em  Blot  and  Detection  kits  (Genius)  were  purchased  from  Boehringer  Mann¬ 
heim.  Probe  labeling  and  detection  followed  the  instructions  of  the 
manufacturer.  Two  to  three  fig  of  tester  and  driver  amplicon  DNA  were 
electrophoresed  on  a  2%  agarose  gel  and  blotted  to  positively  charged  nylon 
membranes  (Boehringer  Mannheim).  For  prehybridization,  the  membranes 
were  placed  at  68°C  for  2—4  h  in  solutions  containing  6X  SSC,  5X  Denhardt  s 
solution,  0.5%  SDS,  0.1  m  EDTA,  and  50  jug/ml  of  salmon  sperm  DNA.  Under 
the  same  conditions,  the  probes  were  added  and  hybridized  to  the  membranes 
overnight.  The  membranes  were  then  rinsed  three  times  with  2X  SSC,  1 X  Blot 
wash  (12  mM  Na2HP04,  8  mM  NaH2P04,  1.4  mM  Na4P207,  and  0.5%  SDS)  at 
68°C  and  further  washed  three  times  (30  min  each)  with  the  same  buffer  at 
68°C.  Next,  the  membranes  were  equilibrated  in  buffer  A  (100  mM  Tris’HCl, 
150  mM,  pH  7.5)  and  transferred  into  buffer  B  (2%  block  reagent  in  buffer  A), 
which  was  incubated  at  room  temperature  for  1  h.  The  membranes  were  then 
washed  2  times  for  15  min  with  buffer  A  and  equilibrated  in  buffer  C  (100  mM 
Tris*HCl,  100  mM  NaCl,  and  10  mM  MgCl2).  Before  the  membranes  were 
exposed  on  Kodak  X-OMAT  film  for  1  h,  they  were  rinsed  in  lumi-P530  for 

I  min  and  kept  in  a  plastic  sheet  protector. 

Genomic  DNA  Southern  Blot.  Genomic  DNAs  were  digested  with  a 
desired  restriction  enzyme  (20  units/pd)  and  electrophoresed  on  1.5%  agarose 
gels,  which  were  then  transferred  to  Hybond  N_  membranes  (Amersham). 
These  membranes  were  exposed  to  UV  light  to  immobilize  the  DNA.  Probes 
for  the  Southern  blot  were  labeled  with  High  Prime  DNA  labeling  kits 
(Boehringer  Mannheim)  following  the  instructions  of  the  manufacturer.  The 
procedure  for  hybridization  and  blot  wash  were  the  same  as  in  the  Amplicon 
DNA  Southern  blot  section. 

Northern  Analysis.  Two  Human  Multiple  Tissue  Northern  blot  panels, 
MTN  and  MTN  H,  were  purchased  from  Clontech,  Inc.  The  MTN  blot  contains 
~2  fig  of  poly(A)+  RNA  per  lane  from  eight  different  human  tissues  (heart, 
brain,  placenta,  lung,  liver,  skeletal  muscle,  kidney,  and  pancreas).  The  MTN 

II  blot  contains  the  same  amounts  of  mRNA  from  an  additional  eight  different 
human  tissues  (spleen,  thymus,  prostate,  testes,  ovary,  small  intestine,  colon, 
and  peripheral  blood  leukocyte).  The  labeling  and  detection  of  the  probes  were 
the  same  as  above. 

DNA  Sequence  and  Chromosome  Assignment.  The  pUC118  plasmid 
containing  the  candidate  DNA  fragment  was  sequenced  using  the  Ampli-Cycle 
sequencing  kit  (Perkin-Elmer),  under  conditions  described  by  the  manufac¬ 
turer.  Chromosome  assignment  for  the  candidate  DNA  fragment  was  deter¬ 
mined  by  genomic  Southern  blot  of  the  Hindlll  digested  monochromosomal 
human/rodent  somatic  cell  hybrid  mapping  panel  #2  (NIGMS  Human  Genetic 
Mutant  Cell  Repository)  while  it  was  used  as  a  probe.  Fine  chromosome 
mapping  was  performed  with  GeneBridge  4  Radiation  Hybrid  Panel  (Research 
Genetics,  Inc.)  by  PCR  amplification  (21). 

Human  Genomic  DNA  Library  Screening.  A  human  placenta  genomic 
phage  library,  EMBL3  SP6/T7  (Clontech,  Inc.),  was  used  for  cloning  a  longer 
genomic  fragment  containing  the  candidate  fragment.  Phage  infection  proce¬ 
dure  was  based  on  the  instructions  supplied  by  the  manufacturer.  Plaques 
(2  X  106)  were  evenly  distributed  on  20  plates  (150  X  15  mm),  then  trans¬ 
ferred  onto  Hybond  N"  membranes.  The  treatment  of  the  membranes,  prep¬ 
aration  of  the  probe,  and  the  blot  wash  were  the  same  as  that  described  in  the 
Genomic  DNA  Southern  blot  section.  The  phage  DNA,  with  human  DNA 
insert,  was  purified  by  the  Lambda  TRAP  Plus  kit  (Clontech,  Inc.)  following 
the  instructions  of  the  manufacturer.  The  individual  insert  was  released  by 
restriction  enzyme  Sstl  cleavage  from  the  phage  DNA  arms,  then  subcloned 
into  pUC118  plasmid. 

Human  cDNA  Library  Screening.  A  human  testes  Agtll  cDNA  library 
(Human  Testis  cDNA  Library;  Clontech,  Inc.)  was  used  to  obtain  an  intact 
gene  following  the  instructions  of  the  manufacturer.  Plaques  (2  X  105)  evenly 
distributed  on  six  plates  (150  X  15  mm)  were  transferred  onto  Hybond  N~ 


membranes.  Southern. analysis  was  performed  as  before.  The  phage  DNA,  with 
human  cDNA  insert,  was  purified  by  the  A  Quick!  Spin  kit  (BIO  101,  Inc.) 
following  the  instructions  of  the  manufacturer.  The  individual  insert  was 
released  by  restriction  enzyme  EcoRI  cleavage  from  the  phage  DNA  arms, 
then  subcloned  into  pUC118  plasmid. 

RT-PCR.  Total  RNAs  were  isolated  from  paired  breast  cancer  and  normal 
tissues  by  RNA  isolation  kit,  RNA  STAT-60  (TEL-TEST,  Inc.).  The  first- 
strand  cDNA  was  synthesized  by  Superscript  Preamplification  system  kit  (Life 
Technologies,  Inc.).  Oligomers  E  (ACCAGAGCGTCCAGTGTGTCC,  sense) 
and  F  (TGGGACTTGATGATCTGAACC,  antisense)  were  used  to  synthesize 
the  TSP50  gene.  The  predicted  size  was  699  bp.  fi-actin  was  used  as  an  internal 
control,  the  sense  and  antisense  primers  of  which  were  5  -GACGACATG- 
GAGA AGATCTGG-3 '  and  5 '  -TGTAGAGGT AGTC AGTC AGG-3 ' .  The  pre¬ 
dicted  size  for  $-actin  was  335  bp.  The  PCR  reaction  mixture  was  comprised 
of  cDNA  derived  from  125  ng  of  RNA,  10  pmol  of  sense  and  antisense  primers 
from  both  TSP50  and  fi-actin,  200  pM  of  four  deoxynucleotide  triphosphate, 
and  0.125  unit  of  Taq  DNA  polymerase  with  reaction  buffer  (Perkin-Elmer)  in 
a  final  volume  of  25  pi  Thirty-eight  cycles  of  PCR  were  carried  out.  Each 
cycle  of  PCR  included  30  s  of  denaturation  at  95°C,  60  s  of  annealing  at  60°C, 
and  60  s  of  extension  at  72°C.  The  PCR  products  were  separated  :on  a  2% 
agarose  gel. 

RESULTS 

Isolation  of  Hypomethylated  Sequences  from  Human  Breast 
Cancer  Biopsies.  The  DNAs  isolated  from  three  paired  human  breast 
cancer  biopsies  (tester)  and  surrounding  normal  tissues  (driver)  were 
cleaved  with  the  Mspl  and  Mse I  enzymes.  The  tester  and  driver 
amplicons  with  both  ends  cleaved  by  Mspl  were  selectively  prepared 
by  PCR  amplification  (see  “Materials  and  Methods”).  After  two 
rounds  of  DNA  hybridization/subtraction  and  PCR  amplification, 
individual  fragments  (DP2)  were  isolated  from  two  breast  cancer 
patients  (see  “Materials  and  Methods”).  The  DP2  fragments  were 
subcloned  into  the  pUC118  vector,  and  the  inserts  were  amplified  by 
PCR.  Twelve  different-sized  inserts  were  selected  from  each  modified 
RDA  and  used  as  probes  for  the  master  amplicon  Southern  blot.  Two 
probes,  BR50  and  BR254,  isolated  from  two  patients,  were  identified 
as  candidate  probes  for  additional  study.  In  this  report,  we  focus  on 
presenting  the  work  that  has  been  done  by  probe  BR50. 

Probe  BR50  was  selected  from  the  DP2  isolated  from  breast  cancer 
patient  no.  14’ s  biopsy  by  the  modified  RDA  technique  (Fig.  1  a) 
because  it  hybridized  a  band  of  much  greater  intensity  in  the  tester 
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Fig.  1.  a,  the  agarose  gel  electrophoresis  of  the  final  difference  products  isolated  by  a 
modified  RDA  technique  from  the  breast  cancer  biopsy  of  patient  no.  14.  Lane  M,  Hae III 
</>174  DNA  size  markers  in  bp.  Lane  A,  DNA  fragments  from  which  BR50  was  isolated. 
b,  the  amplicon  Southern  Blot  results  for  BR50.  Left  lane,  probe  BR50  hybridizes  to  itself 
as  a  positive  control.  In  Lanes  T  and  D,  probe  BR50  hybridizes  with  2  pg  of  tester  (tumor) 
and  driver  (normal)  master  amplicon  DNA  prepared  from  breast  cancer  patient  no.  14.  The 
tester  master  amplicon  DNA  displays  a  much  heavier  hybridized  band  than  the  driver 
master  amplicon  DNA.  The  picture  below  the  Southern  blot  is  the  tester  and  driver 
amplicon  agarose  gel  electrophoresis  before  transferring  it  onto  the  blot  membrane,  which 
served  as  the  DNA  quantitative  control. 
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amplicon  than  that  in  the  driver  amplicon  (Fig.  16).  To  confirm  the 
differences  observed  in  the  tester  and  driver  amplicons,  a  Southern 
analysis  was  performed  on  patient  no.  14’ s  tumor  and  matched  normal 
genomic  DNAs.  Six  fig  of  each  DNA  were  cleaved  with  the  Mspl 
enzyme  and  hybridized  by  probe  BR50.  The  results  showed  that  in  the 
tumor  DNA,  the  probe  hybridized  a  lower  band,  ~1  kbp  long,  of 
much  greater  intensity  than  an  upper  band,  which  was  ~ 2  kbp  in 
length.  In  the  normal  DNA,  just  the  opposite  occurred  (Fig.  2). 
Because  the  sizes  of  the  upper  and  lower  bands  in  the  tumor  and 
normal  control  DNAs  were  the  same,  the  only  reasonable  explanation 
causing  uneven  hybridization  intensities  is  DNA  hypomethylation  in 
the  tumor  cells.  To  examine  whether  the  event  also  existed  in  other 
breast  cancer  patients,  paired  tumor  and  normal  DNAs  isolated  from 
additional  breast  cancer  biopsies  were  cleaved  with  Mspl  and  sub¬ 
jected  to  Southern  analysis.  The  results  showed  that  of  10  samples 
tested,  4  had  similar  hybridization  patterns  to  those  of  patient  no.  14 
(Fig.  2).  It  is  notable  that  in  the  normal  DNAs  of  patient  nos.  3, 4,  and 
5,  more  than  one  upper  band  was  evident.  We  believe  this  can  be 
attributed  to  partial  DNA  demethylation,  instead  of  incomplete  enzy¬ 
matic  digestion.  This  is  because  the  Southern  membrane  was  reblotted 
by  a  control  probe,  which  was  a  background  probe  isolated  along  with 
probe  BR50,  and  only  a  single  hybridized  band  was  displayed  in  each 
lane  (Fig.  2). 

BR50  Also  Detected  Altered  Methylation  Patterns  in  Ovarian 
Cancer.  To  examine  whether  the  hypomethylation  event  also  oc¬ 
curred  in  human  ovarian  cancer,  paired  DNA  samples  isolated  from 
eight  ovarian  cancer  patients  were  analyzed  by  probe  BR50.  The 
DNAs  were  cleaved  with  Mspl  and  hybridized  with  probe  BR50  in  a 
Southern  blot  experiment.  The  results  demonstrated  that  of  eight 
patients  tested,  four  displayed  similar  hybridization  patterns  to  those 
observed  in  the  breast  cancer  samples.  In  the  tumor  DNAs,  the  lower 
band  was  heavily  hybridized  by  the  probe,  whereas  in  the  normal 
control  DNAs,  a  lower  and  upper  band  were  hybridized.  The  com¬ 
pletion  of  DNA  digestion  was  confirmed  by  the  same  control  probe 
used  before,  which  only  hybridized  a  single  band  in  each  lane  (Fig.  3). 
Thus,  BR50  also  detected  altered  DNA  methylation  in  ovarian  cancer. 
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Fig.  2.  The  breast  cancer  genomic  DNA  Southern  blot  for  probe  BR50.  Left  lane  of 
each  blot,  probe  BR50  hybridizes  to  itself  as  a  positive  control.  BR50  hybridizes  with 
A/spI-digested  6  /xg  of  original  tumor  ( 7 )  and  matched  normal  (AO  genomic  DNAs  isolated 
from  patient  no.  14  and  other  breast  cancer  patients.  The  number  for  each  patient  tested 
is  listed  above  the  bracket.  Similar  hypomethylation  patterns  in  patient  no.  14  and  the 
other  patients  are  observed.  The  control  probe  in  the  lower  section  served  as  an  indicator 
of  complete  enzymatic  digestion. 


OV2  OV9  OV12  OV14 

i - ii - ii - ii - 1 

TNTNTNTN 


f 


* 


control 


Fig.  3.  The  different  methylation  patterns  displayed  by  probe  BR50  in  ovarian  cancer. 
The  number  for  each  patient  is  listed  above  the  bracket.  Lanes  T  and  N  include  6  pg  of 
tumor  DNA  and  normal  DNA  digested  with  Mspl,  respectively.  Tumor  and  matched 
normal  DNAs  were  cleaved  with  Mspl  and  probed  by  BR50  in  a  Southern  blot  experiment. 
Methyl-differential  patterns  are  detected  by  BR50.  The  completion  of  the  enzyme  diges¬ 
tion  was  confirmed  by  the  control  probe,  which  only  hybridized  a  single  band  in  each  lane. 


Sequencing  and  Chromosome  Assignment  of  Probe  BR50. 

DNA  sequencing  found  that  BR50  contains  1005  bps,  with  a  CG 
content  of  58%  (Fig.  4a).  The  DNA  fragment  has  a  GGCCGG 
sequence  on  one  end  and  CCGG  sequence  on  the  other  end.  Because 
Mspl  is  sensitive  to  GGCmCGG  and  not  sensitive  to  CCmGG  se¬ 
quences,  it  is  conceivable  that  the  GGCCGG  sequence  was  methyl¬ 
ated  in  normal  breast  tissue  DNA  while  being  hypomethylated  in 
tumor  DNA.  This  prediction  probably  holds  true  because  all  of  the 
candidate  probes,  including  probe  BR254  and  subsequent  probes 
isolated  from  breast  cancer  biopsies,  have  one  end  terminating  with  a 
GGCCGG  sequence  and  the  other  with  a  CCGG  sequence  (data  not 
shown).  A  homologue  search  of  the  NIH  GenBank  discovered  that 
BR50  is  not  homologous  to  any  existing  sequences.4  The  chromosome 
assignment  determined  by  monochromosomal  human/rodent  somatic 
cell  hybrid  mapping  panel  #2  established  its  location  on  chromosome 
3  (data  not  shown).  The  fine  chromosome  position  of  BR50  was 
analyzed  by  PCR  amplification  using  GeneBridge  4  Radiation  Hybrid 
Panel  as  templates.  The  top  and  bottom  strand  primers  for  BR50  are 
5 '  -  ACC  AG  ATGG  AGGC  AGTTGAC-3 '  and  5  '-AAGTGGGTGCT- 
CTTTCTGTG-3',  respectively.  The  result  obtained  from  radiation 
hybrid  mapping  suggests  that  BR50  is  placed  4.29  cR  below  the 
adjacent  STS,  AFMB362WB9,  which  is  179.84  cR  from  the  top  of  the 
chromosome  3  linkage  group.  This  result  confirmed  that  BR50  is 
proximally  located  on  3pl2-14. 

Search  for  the  Coding  Regions  by  Probe  BR50 .  On  the  basis  of 
our  preliminary  analysis,  BR50  was  an  interesting  probe  to  be  used  for 
an  adjacent  gene  search  because  it  was  differentially  methylated  in 
some  breast  and  ovarian  cancer  samples  tested,  although  the  sample 
size  was  small.  BR50  is  a  genomic  fragment,  and  consequently  the 
chances  that  it  encodes  a  polypeptide  are  slim  because  a  majority  of 
human  genomic  DNA  sequences  are  noncoding  sequences.  As  a 
result,  we  decided  that  the  first  step  in  searching  for  a  gene  should  be 
to  isolate  a  longer  DNA  piece  from  a  human  genomic  DNA  library, 


4  GenBank  accession  numbers:  BR50,  U78781;  TSP50,  AF100707. 
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a. 

CCGGAGGCAGGCACAGGACTCGGGAGGGACGCTGCCAGCTCTCTGGGTGCTGAGTTCACAAGGCTG 

CATTCATGATTTTCAATAGACCTGTGATGGTCTGTGCCCAGTGCTGGGGACACAGAAGAGTCAAAC 

CTGGCTCCTGACCTGGACCTGGATCATCACGTGACAGGGAGGAGAGCGATCCAGGCTGATGAGGAA 

AGCGCATGACATGGGGTCTTAGGAGCAGTGAGGGGCAGAGCCATGGCCAAAGGCCCCGCCATGGAA 

GCTGAGGACTCTGGCACCAGATGGAGGCAGTTGACGGACCTCTGCCCTTGGGGTCCAACCCATGGG 

CTTCTCATACATAGGGGTGAAAAAGGCCATTCTATTTATGCAGAATTTTCCCATGTGGCCAGGCAG 

CAGAAGTCCAGAGGGGTAGGGGCCACTCAGGGTCACACAGAACAGCAGTTGCTGAAGACTGGGGAA 

GTCCAGGCCTAGGCTCCACCTGCCCTTCCCCTGACATGGGGCCACCACTAGCCTTTTATGGGCAGG 

CCTGGCTGCTGGTGGTTGGAATAACATCTGACTCCAGTGGGTGTCTGTCACCGTCTCCAGACAGGA 

GACAGAGACAGAGGGTCAAAGTTCACTATGGCTCTTTGGGGCAATGAAATGCTGTGTTCTAGCCTC 

TTGCCAGAAATCAGCCAAAGTCAAGGAAAGCCTGACTCCCACAGTTATCACAGAAAGAGCACCCAC 

TTTCCAGCCCAGACAGCTGCACCCCAGCTGGGTCCTGGCAGCCCCAGCTTCAGCCTGGGCGGTATG 

TTCCAGGCCCCTCGATCATCTGACCCTAATATCACCCCTTCACACCCCCTCCACTTTCTGCGGGAG 

CCACCCCGAACCTTTGAATGGGGGAGATCCTGGAGGCTCTGCAATTTTCAGTGTAAACTGCCTGGA 

GTTCCCCACTTCACCCTCATCTGGTTCACCTGTGGACTCCCAACAGAGCAGGCCCAGGAAACGCGG 

GGCCTCTGAGGCCGG 


Fig.  4.  a,  the  sequence  of  the  genomic  DNA  probe  BR50.  b,  nucle¬ 
otide  and  predicted  amino  acid  sequences  of  the  human  TSP50  gene.  .The 
adenosine  at  the  ATG  (bold-type)  initial  codon  is  considered  the  number 
1  nucleotide.  The  stop  codon,  TGA,  is  also  in  boldface.  The  sequences 
of  exon  BR50-44  and  BR50-45  are  in  italics. 


-59  gtcgtgggggcggc 

actgggagcgccttccggagagacgcagtcggctgccaccccggg 

1  atgggtcgctggtgccagaccgtcgcgcgcgggcagcgcccccgg 
MGRWCQTVARGQRPR 

4  6  acgtctgccccctcccgcgccggtgccctgctgctgctgcttctg 
TSAPSRAGALLLLLL 
91  ttgctgaggtctgcaggttgctggggcgcaggggaagccccgggg 
LLRSAGCWGAGEAPG 
136  gcgctgtccactgctgatcccgccgaccagagcgtccagtgtgtc 
ALSTADPADQSVQCV 
181  cccaaggccacctgtccttccagccggcctcgccttctctggcag 
PKATCPSSRPRLLWQ 
226  accccgaccacccagacactgccctcgaccaccatggagacccaa 
TPTTQTLPSTTMETQ 
271  ttcccagtttctgaaggcaaagtcgacccataccgctcctgtggc 
FPVSEGKV  DU.P  Y  R  S  C  G 
316  ttttcctacgagcaggaccccaccctcagggacccagaagccgtg 
F  S  YEQDPTLRDPEAV 
3 61  gctcggcggtggccctgga tggtcagcgtgcgggccaatggca ca 
ARRWPWMVSVRANGT 
406  cacatctgt gccggca ccatcattgcctcccagt gggt gctgact 
HICAGTIIASQWVLT 
451  gtggcccactgcctga tcfcggcgtgatgttatctactcagtgagg 
VAHCLIWRDVIYSVR 
496  gtggggagtccgtggattgaccagatgacgcagaccgcctccgat 
VGSPWIDQMTQTASD 
541  gtcccggtgctccaggtcatcatgcatagcaggtaccgggcccag 
VPVLQVIMHSRYRAQ 
586  cggttctggtcctgggtgggccaggccaacgacatcggcctcctc 
RFWSWVGQANDIGLL 
631  aagctcaagcaggaactcaagtacagcaattacgtgcggcccatc 
KLKQELKYSNYVRP  I 
67  6  tgcctgcctggcacggactatgtgttgaaggaccattcccgctgc 
CLPGTDYVLKDHSRC 
721  actgtgacgggctggggactttccaaggctgacggcatgtggcct 
TVTGWGLSKADGMWP 
766  cagttccggaccattcaggagaaggaagtcatcatcctgaacaac 
QFRTIQEKEVIILNN 
811  aaagagtgtgacaatttctaccacaacttcaccaaaatccccact 
KECDNFYHNFTKIPT 
856  ctggttcagatcatcaagtcccagatgatgtgtgcggaggacacc 
LVQI  IKSQMMCAEDT 
901  cacagggagaagttctgctat gagctaactggagagcccttggtc 
HREKFCY  ELTGEPLV 
946  tgctccatggagggcacgtggtacctggtgggattggtgagctgg 
CSMEGTWYLVGLVSW 
991  ggtgcaggctgccagaagagcgaggccccacccatctacctacag 
GAGCQKSEAPPIYLQ 
1036  gtctcctcctaccaacactggatctgggactgcctcaacgggcag 
V  S  S  YQHWIWDCLNGQ 
1081  gccctggccctgccagccccatccaggaccctgctcctggcactc 
ALALPAPSRTLLLAL 
1126  ccactgcccctcagcctccttgctgccctctgactctgtgtgccc 
PLPLSLLAAL  * 

1171  tccctcacttg 
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with  the  hope  that  it  may  contain  exon(s).  It  is  well  known  that  DNA 
methylation  sites  can  be  near  genes  (18);  therefore,  we  decided  to 
screen  a  human  placenta  genomic  phage  library,  EMBL3  SP6/T7 
(Clontech,  Inc.),  where  the  insertion  sizes  were  relatively  small  (9-28 
kbp).  For  this  purpose,  2  X  106  plaques  derived  from  the  library  were 
screened  by  probe  BR50.  As  a  result,  a  17-kbp  length  clone  was 
isolated.  To  obtain  the  sequence  information,  the  DNA  clone  was 
released  from  the  phage  DNA  arms  by  restriction  enzyme  Sstl  cleav¬ 
age,  which  generated  eight  DNA  fragments.  All  eight  fragments  were 
subcloned  into  pUC118  plasmids.  The  fragments  smaller  than  1  kbp 
were  completely  sequenced,  whereas  the  fragments  larger  than  1  kbp 
were  partially  sequenced  from  both  the  plasmid  and  insert  junctions. 
A  homologue  search  of  the  NIH  GenBank  revealed  two  exons, 
BR50-44  and  BR50-45,  which  contained  112  and  132  nucleotides, 
respectively.  Both  exons  encoded  polypeptides  that  were  ~50%  iden¬ 
tifiable  to  several  mammalian  proteases,  such  as  serine  proteases  and 
tryptases.  The  BR50-45  sequence  was  found  142  bp  upstream  of  the 
BR50  sequence. 

Because  we  did  not  know  the  exact  positions  of  the  two  exons  in  the 
17-kbp  fragment,  it  was  possible  that  other  exon(s)  might  lay  between 
them.  To  gain  a  longer  coding  sequence,  we  designed  four  oligomers 
(A,  B,  C,  and  D),  based  on  the  sequence  information  of  both  exons,  to 
perform  PCR.  Oligomer  A  (5 ' -CCTGG ATGGTC AGCGTG-3 ' )  and  B 
(5 '  -CTGGGAGGC A ATG ATGGT-3 ' ),  which  were  on  the  compli¬ 
mentary  strand,  were  based  on  the  sequence  information  of  BR-44 ; 
and  C  (5 '  -CTGGAGAGCCCTTGGTCT-3 ')  and  D  (5'-CAGTGTTG- 
GTAGGAGGAG-3'),  which  were  on  the  complimentary  strand,  were 
based  on  the  sequence  information  of  BR-45.  A  strategy  using  four 
different  combinations  of  oligomer  pairs  was  used  to  perform  PCR  by 
using  the  Human  Universal  cDNA  Library  Panel  (Clontech,  Inc.).  A 
PCR  product,  which  was  about  700  bp  in  length,  was  generated  from 
one  oligomer  combination  (A/D,  5' -CCTGGATGGTC AGCGTG-3 7 
5  '-C AGTGTTGGTAGGAGGAG-3  ')■  This  PCR  product  was  directly 
sequenced.  Combining  the  sequence  information  of  the  PCR  product 
and  the  two  exons,  we  obtained  a  cDNA  fragment  that  contained  974 
bps.  The  DNA  homologue  search  of  the  NIH  GenBank  revealed  again 
that  it  coded  for  a  protease-like  protein,  and  the  overall  identity  was 
-40%. 

The  Candidate  Gene  Is  Highly  and  Specifically  Expressed  in 
Human  Testes.  To  obtain  a  full-length  cDNA,  it  is  critical  to  use  the 
right  cDNA  library  where  the  gene  of  interest  is  expressed.  Thus,  two 
Human  Multiple  Tissue  Northern  blot  panels,  MTN  and  MTN  II, 
containing  16  different  tissue  mRNAs  (Clontech,  Inc.),  were  used  to 
test  the  expression  of  the  candidate  gene  by  using  the  700-bp  cDNA 
PCR  product  as  a  probe.  The  results  showed  that  there  were  no  visible 
transcripts  of  this  gene  in  the  eight  mRNAs  (heart,  brain,  placenta, 
lung,  liver,  skeletal  muscle,  kidney,  and  pancreas)  included  in  the 
MTN  panel  (data  not  shown).  In  the  MTN  II  panel,  a  1.7-kbp  band 
was  heavily  hybridized  by  the  probe  exclusively  in  the  testes  mRNA 
as  compared  with  the  control  probe,  which  was  the  human  rab6  gene 
(Fig.  5).  These  results  suggested  that  the  gene  that  we  were  searching 
for  is  a  tissue-specific  gene.  We  have  named  the  gene  TSP50.  At  this 
moment,  the  biological  function(s)  of  the  gene  in  human  testes  re¬ 
mains  unknown. 

Isolation  of  the  Full-length  TSP50  Gene  from  a  Human  Testes 
cDNA  Library.  TSP50  is  highly  expressed  in  human  testes;  to  search 
for  the  full-length  TSP50  gene,  a  human  testes  cDNA  library  (Human 
Testis  cDNA  Library;  Clontech.  Inc.)  was  screened  by  the  700-bp 
TSP50  cDNA  sequence.  A  cDNA  clone  containing  the  probe  se¬ 
quence  was  isolated.  Sequence  analysis  suggests  that  this  fragment 
encodes  a  protein  with  385  animo  acids  (Fig.  4b).  There  is  a  stop 
codon  located  at  the  117th  bp  upstream  of  the  first  initial  translation 
site,  and  there  is  a  125-bp  untranslated  region  before  a  polyadenosine 
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Fig.  5.  Northern  blot  results  analyzed  by  the  700-bp  TSP50  fragment.  The  Human 
Multiple  Tissue  Northern  blot  panel,  MTN  II,  containing  the  mRNA  of  eight  different 
tissues  (Clontech  Inc.),  was  tested  to  determine  the  expression  levels  of  the  gene. 
Compared  with  the  control  (the  human  Rab6  gene,  which  was  evenly  expressed  in  all 
tissues),  TSP50  was  highly  expressed  in  the  testes  tissue  but  not  in  the  other  tissues. 


sequence.  These  results  imply  that  a  full-length  gene  has  been  ob¬ 
tained.  It  is  also  notable  that  the  BR50  probe  sequence  is  located  at  the 
3'  end  of  the  gene  and  is  only  17  bp  downstream  from  the  poly  ade¬ 
nosine  adding  signaling  site.  The  exons  BR50-44  and  BR50-45  encode 
animo  acids  from  103  to  157  and  308  to  385,  respectively  (Fig.  4.  b.). 
The  3'-  untranslated  region  before  the  polyadenosine  site  is  also 
included  in  the  sequence  of  BR50-45. 

DNA  Methylation  Status  of  the  TSP50  Gene  in  Human  Testes 
and  Other  Normal  Tissues.  Our  studies  have  proven  that  TSP50  is 
a  tissue-specific  gene,  and  the  methylation  patterns  in  its  3'  region 
were  altered  in  some  breast  and  ovarian  cancers.  It  is  also  known  that 
many  tissue  specific  genes  are  methylated,  and  this  methylation  may 
regulate  their  expression  (22-24).  To  explore  the  possible  relationship 
between  TSP50  gene  expression  and  DNA  methylation  in  different 
normal  human  tissues.  Southern  analysis  was  performed.  The  normal 
tissues  tested  included  the  testes,  where  TSP50  was  expressed,  and 
bladder,  blood,  breast,  colon,  lung,  kidney,  placenta,  and  ovary  sam¬ 
ples,  where  TSP50  was  apparently  not  expressed.  To  perform  the 
Southern  analysis,  BR50  was  used  as  a  probe.  DNAs  isolated  from  the 
nine  tissues  were  digested  by  Mspl  and  Hpall ,  which  is  an  isoschi- 
zomer  of  the  Mspl  enzyme  and  the  most  popular  enzyme  used  to  study 
DNA  methylation  patterns  (25).  Hpall  digestion  showed  that  in  the 
testes  DNA,  two  bands,  probably  released  from  each  allele  by  enzyme 
cleavage,  were  hybridized  by  the  probe.  However,  in  the  DNAs  of 
other  tissues,  the  corresponding  bands  were  either  not  hybridized  or 
hybridized  to  a  much  smaller  degree  (Fig.  6a).  For  Mspl  cleavage, 
both  bands  were  released  in  different  tissues  to  various  extents  (Fig. 
6b).  Both  blots  used  a  genomic  fragment  that  did  not  detect  differen¬ 
tial  DNA  methylation  as  a  control  to  determine  complete  enzymatic 
digestion  (Fig.  6).  These  results  demonstrated  that  the  TSP50  gene 
was  differentially  methylated  in  various  human  tissues.  In  general, 
DNA  demethylation  in  the  testes  is  correlated  with  high  levels  of  gene 
expression.  Conversely,  DNA  methylation  is  correlated  with  gene 
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Fig.  6.  DNA  methylation  status  of  the  TSP50  gene  in  nine  normal  human  tissues 
examined  by  Southern  blot.  In  a  and  b,  the  results  obtained  from  Hpall  and  Mspl 
digestion,  respectively,  are  shown.  Six  /u-g  of  DNA  isolated  from  each  tissue  were  cleaved 
by  the  enzymes  and  subjected  to  Southern  analysis  by  probe  BR50.  a,  the  results  show  that 
two  bands,  which  are  approximately  1  and  2  kbp  in  length,  were  released  by  Hpall  only 
in  the  testes  tissue,  b,  a  2-kbp  band  was  released  by  Mspl  in  most  tissues.  In  a  and  b,  the 
control  probe  hybridized  a  single  band  in  the  DNA  of  each  tissue,  which  provides  proof 
of  complete  enzymatic  digestion. 


silencing  in  the  bladder,  blood,  breast,  colon,  lung,  kidney,  placenta, 
and  ovary  tissues. 

TSP50  Was  Differentially  Expressed  in  Some  Breast  Cancer 
Tissues.  Preliminary  results  demonstrated  that  TSP50  was  differen¬ 
tially  methylated  in  40%  of  the  breast  cancer  tissues  tested.  This 
suggested  that  it  could  also  be  differentially  expressed  in  cancer.  To 
test  this  possibility,  RT-PCR  was  carried  out  to  determine  TSP50 
expression  levels  in  18  paired  breast  cancer  tissues.  Our  findings 
showed  that  TSP50  PCR  products  were  generated  in  five  tumor 
tissues,  whereas  in  their  normal  controls,  they  were  not  visible  relative 
to  the  control  gene,  fi-actin  (Fig.  7).  Products  generated  from  the  five 
patients  were  gel  purified  and  sequenced.  DNA  sequence  analysis 
confirmed  that  the  PCR  products  synthesized  were  the  TSP50  gene 
(data  not  shown).  Therefore,  among  the  18  paired  samples  tested,  28% 
of  the  tissues  expressed  the  TSP50  gene.  At  this  moment,  we  cannot 
answer  the  question  of  whether  activation  of  the  TSP50  gene  in  cancer 
is  a  consequence  or  a  causal  factor  of  neoplastic  growth.  To  find  the 
truth,  it  will  be  necessary  to  perform  in  vitro  cellular  transformation 
and  in  vivo  tumor  induction  assays. 


lated.  The  extensive  study  of  one  of  the  two  fragments,  BR50 ,  is  the 
subject  of  this  report. 

It  has  been  reported  that  aberrant  DNA  methylations  occur  con¬ 
stantly  in  human  tumors  (9-12,  24,  26).  DNA  hypomethylations 
could  activate  oncogenes,  whereas  DNA  hypermethylation  could  in¬ 
activate  recessive  oncogenes.  Both  events  could  result  in  neoplastic 
growth  (27-35).  The  correlation  between  aberrant  DNA  methylations 
and  malignancies  suggests  that  differentially  methylated  fragments  in 
tumors  isolated  by  a  modified  RDA  technique  could  be  a  valuable  tool 
in  the  search  for  genes  that  might  be  related  to  cancer  development. 
BR50  was  considered  to  hold  such  value  because  it  not  only  detected 
DNA  hypomethylations  in  the  original  breast  cancer  tissues  from 
which  it  was  isolated  but  also  detected  DNA  hypomethylations  in 
other  breast  and  ovarian  cancer  samples. 

Our  first  step  in  processing  the  gene  search  was  to  screen  a  human 
genomic  phage  library.  A  17-kbp  DNA  fragment  was  isolated,  and 
sequence  analysis  suggested  that  this  fragment  contained  at  least  two 
exons  that  were  homologous  to  mammalian  proteases.  The  sequencing 
information  of  the  exons  led  to  the  discovery  of  a  974-bp  gene 
fragment  from  a  human  cDNA  library  panel  by  PCR  amplification.  To 
obtain  a  full-length  gene,  a  Northern  evaluation  on  16  different  types 
of  human  RNAs  was  performed.  The  results  demonstrated  that  the 
target  gene  was  specifically  expressed  in  human  testes  tissue.  This 
information  secured  the  isolation  of  an  intact  gene,  TSP50 ,  by  screen¬ 
ing  a  human  testes  cDNA  library.  The  sequence  analysis  revealed  that 
the  TSP50  gene  encodes  a  protein  that  shares  ~40%  identity  with 
mammalian  proteases,  such  as  human  tryptase  or  mouse  serine  pro¬ 
tease.  This  would  suggest  that  the  product  of  the  TSP50  gene  is  a 
protease.  However,  at  this  point,  we  do  not  know  the  physiological 
function(s)  of  this  protease.  One  may  assume,  though,  that  it  could  be 
a  component  in  the  human  reproductive  pathway  due  to  it  being  solely 
expressed  in  the  testes. 

It  is  common  knowledge  that  the  expression  of  many  tissue-specific 
genes  is  regulated  by  DNA  methylations,  which  usually  modify  the 
promoter,  or  sometimes,  3'  regions  (3,  4,  22-24,  36).  Our  preliminary 
results,  although  only  obtained  from  analyzing  the  DNA  methylation 
status  of  the  3'  flanking  region  of  the  gene,  have  proven  that  TSP50 
is  one  of  those  tissue-specific  genes.  It  will  be  interesting  to  discover 
whether  the  promoter  region  of  the  gene  is  also  methylated  when  the 
corresponding  sequence  information  is  available.  The  Hpall  and  Mspl 
methylation-sensitive  Southern  analysis  of  the  3 '  region  of  the  TSP50 
gene  demonstrated  that,  in  //pall-digested  DNAs,  probe  BR50  hybrid¬ 
ized  two  bands  in  the  testes  tissue  but  none  in  the  other  samples.  The 
lower  band,  which  was  the  same  size  as  the  probe,  represented  the 
unmethylated  DNA  pattern,  whereas  the  upper  band  obviously  con- 
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DISCUSSION 

A  modified  RDA  technique  was  used  to  study  genetic  alterations  by 
using  breast  cancer  biopsies  as  a  working  model  system.  As  a  result, 
two  hypomethylated  genomic  DNA  fragments  were  successfully  iso- 


Fig.  7.  The  results  of  RT-PCR  for  TSP50  differentially  expressed  in  five  breast  cancer 
and  normal  control  tissues.  Lane  MW,  HaeWl  0174  markers  in  bp.  The  fi-actin  and  TSP50 
lanes  served  as  positive  controls;  they  were  generated  by  using  cDNA  prepared  from 
testes  tissue  RNA.  Lane  fi-actin+TSP50  contains  simultaneously  generated  TSP50  and 
fi-actin  from  testes  cDNA.  The  number  for  each  patient  tested  is  listed  above  the  bracket. 
T  and  N,  tumor  and  normal  tissues,  respectively. 
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tained  the  internal  Hpall  recognition  site(s),  which  remained  methy¬ 
lated.  In  Mspl-digested  DNAs,  the  upper  band  was  dominant  in  most 
tissues,  whereas  in  the  testes,  the  lower  band  was  dominant.  These 
results  suggest  that  the  GGCCGG  end  of  BR50  was  methylated  in 
other  tissues  but  not  in  the  testes.  The  DNA  methylation  patterns 
observed  in  both  blots  are  probably  allelic  orientated.  It  seems  that 
DNA  hypomethylation  was  accompanied  by  the  expression  of  the 
gene  in  the  testes,  and  conversely,  DNA  hypermethylation  was  ac¬ 
companied  by  the  silencing  of  the  gene  in  other  tissues.  The  correla¬ 
tion  between  DNA  methylation  and  gene  expression  provided  addi¬ 
tional  proof  that  DNA  methylation  could  be  an  important  mechanism 
in  governing  the  expression  of  the  genes  in  various  differentiated 
human  cells  (12,  37,  38).  In  addition,  the  differential  expression  of  the 
TSP50  gene  has  been  tested  in  18  paired  breast  cancer  biopsies.  Our 
findings  have  shown  that  this  gene  was  activated  in  five  cancer 
samples.  In  the  near  future,  more  samples  from  different  types  of 
cancer  will  be  examined,  and  the  possibility  that  the  TSP50  gene 
product  might  be  one  of  the  factors  that  stimulate  human  cancer  will 
be  further  explored. 

Recently,  by  using  the  same  technique,  DNA  fragments  that  rep¬ 
resent  DNA  amplifications,  deletions,  and  rearrangements  were  also 
obtained  (data  not  shown).  Hopefully,  this  technique  will  lead  to  the 
discovery  of  additional  novel  genes  that  may  be  related  to  cancer 
development.  On  the  basis  of  our  experience,  the  process  of  isolating 
the  TSP50  gene  was  made  considerably  easier  by  the  modified  tech¬ 
nology,  where  the  Mspl  enzyme  was  used  as  the  master  enzyme.  The 
ability  of  Mspl  to  recognize  GC-rich  sequences  and  its  sensitivity  to 
DNA  methylation  (17,  18)  apparently  accelerated  our  gene  search. 
Furthermore,  the  double  enzyme  cleavage  strategy  provides  another 
unique  and  efficient  feature  for  this  technique  because  ~40%  of  a 
human  genome  can  theoretically  be  analyzed  by  a  single  master 
enzyme  when  it  is  combined  with  a  different  partner  enzyme. 
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